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Abstract —The molecular weight of sugar in blood is an 
important index to recognize pancreatic cancer. The molecular 
weight data of sugar in the blood of pancreatic cancer were 
collected, the statistical mathematical model was established, the 
mean variance cancer recognition method was constructed, and 
the mean and mean variance of molecular weight were 
calculated. The calculation results are given in chart form. The 
analysis of the results showed that the variance of patient and 
health data of mean and mean variance was completely 
separated, and the recognition rate of pancreatic cancer was as 
high as 86%. 

Index Terms —Pancreatic cancer, blood, sugar, statistics. 

I. Introduction 

Cancer leads to death, cancer patients often have abnormal 
glycosylation, one of which is abnormal N-glycan. Studying 
the abnormal expression of N-glycan is an important way to 
identify cancer. In the serum of cancer patients, N-glycan has 
significant abnormal expression [1]. Therefore, in recent 
years, N-glycan in serum is a hot spot in cancer research. 
Rudd et al. Pointed out that N-glycan is an important cancer 
molecular marker, especially suitable for early diagnosis, 
stage recognition and prognosis monitoring of cancer [2]. A 
large number of studies have shown that in the serum of 
cancer patients, the expression of N-glycan is significantly 
abnormal [3]. For example, in the study of breast cancer and 
lung cancer, abnormal sialylation and fucosylation were 
found [4] [5]; in the study of liver cancer, the change of 
glycosyltransferase can cause the change of N-glycan content 
[6]. Similar phenomena have been found in prostate cancer, 
colorectal cancer and other cancers [7] [8]. However, while 
the abnormal expression of N-glycan caused by cancer has 
been widely concerned, the relationship between N-glycan 
and cancer stage is rarely reported. 

The survival rate and poor treatment rate of pancreatic 
cancer are the most challenging problems in biomedical and 
clinical fields. How to improve the long-term survival rate 
and prognosis of patients with pancreatic cancer has become 
an urgent problem[9]. 

In this paper, we will study the molecular weight of sugar 
in the blood of pancreatic cancer, and construct the mean 
variance pancreatic cancer recognition method with statistical 
theory. 


weight measurement data of the k th pancreatic cancer patient 

is Ujk, k = 1,2,... K. The sum of the measurement data of each 
patient is 
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Divide the measured data by this number, and then express 
it in percentage 
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( 1 ) 


If the patient measurement data is set as one row and the 
k-th patient measurement data is set as the k-th row, then the 
k-th patient data will form the k-th row j-column table. The 
above formula is the average value of the data in column J of 
the table. 

In the experiment, the molecular weight of glucose in the 
blood of 27 patients with pancreatic cancer was measured. 
The total molecular weight was 47 and 1269 data were 
obtained. Figure 1 shows part of the calculated data of the 
mean molecular weight of patients with pancreatic cancer. 
According to the calculation data, when the molecular weight 
is 2792 



II. Average statistics 

III. Variance statistics 

Let u be the molecular weight of the sugar in the blood of The calculation formula of mean square deviation is: 
pancreatic cancer. The value is Uj, j = 1,2,..., J. the molecular r~j ^ 

Dxj = J-£ (x jk ~ Exjf j = 1,2,...,J 
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( 2 ) 

Figure 2 shows the calculation data of molecular weight 
mean square deviation of pancreatic cancer patients. It can be 
seen from the figure that When the molecular weight is 
1998.9, the minimum corresponding mean square deviation is 
0.033291. 



IV. Variance of mean 

Let x(j,k)= Xjk Is the j-th molecular weight measurement 
data of the k-th individual, Let transformation: 

y(j, k ) = ^r ^-1 k = 1,2,.., KJ = 1,2,.. j 

E x j 

here 

1 K 

Exj = —zl x j k j = l,2,...,J 

E k=l 

The mean of y(j,k) is 

z(j) = Ey(j ) = y(k, j ) j = 1,2,..., J 

E k=l (D\ 



Fig.3, Mean square deviation of patients and healthy 
people, patient triangle. Pentagram of healthy people 


Then calculate the mean square deviation of {Z (1), Z 
(2),..., Z (P)} of the first P data, and record it as Dy (P), P = 


2 3 J 

According to the measurement data, according to the 
measurement data of patients and healthy people, the variance 
is calculated, and the results are shown in Figure 3. The 
patient triangle in the figure. Healthy person pentagram. It can 
be seen from the figure that the mean square deviation of 
patients and healthy people is completely separated. 

V. Pancreatic cancer recognition 

The above steps of calculating mean square deviation are 

1) To collect the molecular weight data of glucose in the 
blood of pancreatic cancer ujk, j=l,2,..,Jk=l,2,..K. 

2) Calculate the sum of each person's measurements 

K 

Sk ~ ^ 1 u jk k — 1 , 2 ,••* 9 K 

k =1 

3 ) Measured data in percentage 

X kj =^JL*10 o k = 1,2,..., K,j = 1,2,../ 

N- 

4) Calculate the j-th molecular weight average: 

1 K 

Exj = — X x jk y=l,2,...,/ 

E k=l 

5) Let x(j,k)= xjk is the measurement data of the j-th 
molecular weight of the k-th person, Let transformed 

y(j, k ) = ELE- 1 £ = 1,2,..,K; j = 1,2,.../ 

E x j 

6) The mean of y(j,k) are 

z(j) = Eyi.j) = y(k, j ) j = 1,2,..., J 

E k =i 

7) Then, the mean square deviation of {Z (1), Z (2),..., Z 
(P)} of the first P data is calculated, which is recorded as Dy 

(P). 

Using the above mean square deviation calculation results, 
the following gives the pancreatic cancer recognition method. 
Set up the molecular weight data set of sugar in the blood of 
known pancreatic cancer patients U={ujk, j=l,2,..,J, 
k=l,2,..K}, Molecular weight data set of sugar in blood of 
healthy people V={vjk, j=l,2,..,n, k=l,2,..m}, Molecular 
weight data set of sugar in the blood of cancer patients to be 
examined is A={vjk, j=l,2,..,n, k=l}. The steps of 
recognition of pancreatic cancer are: 

1) The mean square deviation of known pancreatic cancer 
patients was calculated by using the data in set U, which was 
recorded as Dy (P,b). The mean square deviation of known 
healthy people is calculated by the data in set V, which is 
recorded as Dy (P,a); 

2) Consider to be check data in set A, Add the data of A to 
U and V, and calculate the mean square deviation with the 
data in the set (U + A), which is recorded as Dy (P,b '). 
Calculate the mean square deviation with the data in the set (V 
+ A), and record it as Dy (P,a '); 

3) Calculate the difference valueldy (P,b) - Dy (P,b') I, and 
the difference valueldy (P,a) - Dy (P,a f ) I, wherelxlis the 
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absolute value of x. 

4) If E I Dy (P,b) - Dy (P,b ') I < E I Dy (P, a) - Dy (P, a') I, 
the subject is a cancer patient, otherwise it is a healthy person. 
Where D is the sum of all data. 

VI. Calculation results and discussion 

The above mean variance method is used to calculate the 
statistical value of 51 people. The results are listed in table 1. 
According to the data in the table, the correct probability of 
identifying cancer patients is (26-4) / 26 = 84.6%, and the 
correct probability of identifying healthy people is (25-3) / 25 
= 88%. The overall correct rate is (51-7) / 51 = 86%. The 
highest correct probability of traditional classification method 
is 43% [10]. 
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Table 1 Calculation results 


class 

Accuracy Healthy 
rate sample 

Pancreatic cancer 
samples 

Healthy 

88% 22 

3 

sample 

Pancreatic 

84.60% 4 

22 

cancer samples 

total 

86.30% 26 

25 


VII. Conclusion 

This paper studies the recognition of pancreatic cancer. 
After analyzing the measurement data, we found that the 
measurement data of pancreatic cancer patients fluctuated 
greatly, so we constructed the mean variance recognition 
method. This method first calculates the mean value of the 
measure data, and then calculates the mean square deviation 
of the mean value. According to the mean variance method, 
the calculation shows that the mean variance of pancreatic 
cancer patients is 100% separated from the mean variance of 
healthy people. The results show that the recognition rate of 
pancreatic cancer is as high as 86%. 
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